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(54) Method and system for silence removal in voice communication 



(57) A method and an apparatus for removing the 
silence from the digitalized voice signals conveyed 
through packets or cells switching netNworks. The 
silence sannples are neither packetized nor sent over 
the network but are regenerated at the output of the net- 
work. The silence sannples generated are white noise 
samples, where the level Is adapted to the t>ackground 
noise of the silence sanrples received at the input node 
of the network. For long periods of silence, the white 
noise les^el Is periodically refreshed to be adapted to the 
last silence sannples received at the input node of the 
network. The method provides also a control of packet 
or cell loss. The method uses or not control packets; in 
the later case, it can be used for ATM networks with 
AAL1, 

The method Is implemented as a program executed in a 
Digital Signal Processor kx^ated on adapter cards dedi- 
cated to voice processing in the network access nodes. 
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Description 

RELD OF THE INVENTION 

This invention deals with transport of voice informa- 
tion in packets or cells swritching networks such as 
Frame Relay or ATM (Asynchronous Transfer Mode) 
networks and particularly with the function of silence 
removal. 

BACKGROUND OF THE INVENTION 

In digital networks, voice is encoded with a PCM 
(Pulse Code Modulation) coding system which provkles 
a constant bit rate of 64 Kbits per second. This bit rate 
conresponds to the sampling of the voice analog signal 
at the rate of 8,000 times per second, each sample 
being represented by 8 bits. In order to transport this 
voice signal across a packet or cell network, individual 8 
bits samples are assembled into packets. 

Packetized systems exploit the bursty nature of 
voice and data traffic to multiplex the traffic of several 
users so that they can share transmission bartdwidth 
and switching resources. Packet header includes the 
necessary corrtrol information altowing a wide range of 
coding schemes for voice and data and thus easy inte- 
gration of multimedia traff (voice, video and data). The 
•packet networks' considered for the present invention 
are either packet based such as Frame Relay or cells 
based such as Asynchronous Transfer Mode (ATM) net- 
works. 

One way of saving l^arKlwidth is to reduce the bit 
rate required from the 64 Kbps standard rate. Adaptive 
Differential PCM (ADPCM) Is a compression algorithm 
reducing the bit rate to 32 Kbps without measurable k>ss 
of quality: it encodes each sample as the difference 
between it and the last sample, rather than as an abso- 
lute amplitude value. Voice compression principle relies 
on the fact that a voice signal has considerable redun- 
dancy ; which means that the general characteristics of 
the next few samples can be predicted from the last few 
sanrpl^. The Global System for Mobile Communica- 
tions (GSM) standard for European Cellular Telecom- 
munications System is one example of conr^resslon 
algorithm. As well as saving t>andwidth, the conpres- 
sion algorithm must be good indeed in term of quality, 
which means that a listener must not detect the differ- 
ence k^etween an original analog signal and one that 
has been encoded and later decoded. 

Speech occun-ing in talk spurts' and being half - 
duplex {fxrosX of the time only one person Is talking). 
60% of any (one way ) voice conversation , consists of 
silence. ArK>ther way of saving barxiwidth is to see 
when there is no actual speech arxl just stop sending 
voice samples during the gaps. This silence removal 
function consists in detecting silence and stopping 
sending of voice packets at the originating end and gen- 
erating a background noise instead of silence at the ter- 
minating end. It is of a better comfort to generate a 



background noise instead of silence to minimize the dis- 
continuity between the background noise between 
speech and silence. Careful selection of the noise 
power is necessary to avoid the problem of the 'noise 

5 pumping', an annoying contrast between the back- 
ground noise during the silence period and the back- 
ground noise during speech spurts. 

Most of the so called comfort noise generators 
takes Into account the quality requirement for the choice 

10 of the noise power. The GSM standard for digital cellular 
telecommunications provkJes imbedded In its compres- 
sion algorithm a comfort noise generator. The parame- 
ters of the comfort noise are estimated on the transmit 
side and transmitted to the receive side before the radio 

15 transmission is cut and at a regular low rate aftenwards. 
This allows the comfort noise to adapt to the changes of 
the noise on the transmit side. The comfort noise gener- 
ated is a good quality but this generator can only be 
used with the GSM coding scheme for compression 

20 algorithm: it cannot be used to add a 'comfort noise gen- 
erator* to other voice coding algorithms such as PCM 
(no compression), ADPCM, SEC, and CELR 

The standard G.764 is a Voice Packetization Proto- 
col of CCITT which has t>een defined to operate with 

25 any voice compression algorithm. In order to play out at 
the terminating end a background noise, the level of 
noise is specified in a 4-bit field of the packet header. An 
additional bit called as more (M) bit Is used to distin- 
guish between gaps due to silence and gaps due to 

30 missing/discarded cells or packets. This solution pro- 
vides a comfort noise of good quality, is independent of 
the compression algorithm but implies a 5 bits overhead 
on each packet or cell transported. When used with 
ATM cells, the additional 5 bits for voice header cannot 

55 be protected with the ATM Adaptation Layer of type 1 
(AAL1). Thus, the Adaptation Layer of type 2 (AAL2) 
which is intended for variable-rate information or AAL5 
need to be used. Now, there are two disadvantages 
related to the use of AAL2 or AAL5 for ATM. because 

40 AAL2 is not yet full defined by international standards 
and AAL5 inr^UEUiJLi^ng a large overhead. 

It is ar fobjecL fflthe present invention to provide a 
method and a system for silence removal independent 
from the voice coding or voice compressfon algorithms. 

45 Another object of the present invention is to provide 
a method and a system for silence removal which can 
be used in ATM networks and is compatible with the 
ATM adaptation layer of tupe one (AAL1). 

It is another object of the present invention to pro- 

so vide such a method and system wherein the overhead 
on the pad^ts or cells is minimized, while offering, a 
good quality in terms of comfort during speech. 

The method according to the present invention for 
transporting a stream of packets between a transmitting 

55 side and a receiving skie through a communication net- 
work, said stream including voice p>ackets correspond- 
ing to speech periods and silence packets 
corresponding to silence perkx:ls, comprises: at the 
transmitting side, detecting the ends of speech periods 
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and transmitting only the voice packets corresponding 
to the speech periods followed by the silence packets 
following the speech periods for a first period time, and 
at the receiving side, reconstituting said stream by Inter- 
leaving between the voice packets received from the 5 
transmitting side, white noise packets corresponding to 
background noise of the preceding voice packets. 
In a first embodiment it comprises: 

at the transmitting side: 10 

detecting the end of speech periods, 

transmitting the silence packets during the first 
period of time after the end of speech periods, is 

calculating at the end of speech periods, a 
white noise level corresponding to the back- 
ground noise of at least one preceding packet. 

20 

transmitting to the receiving side, a control 
packet including said white noise level, 

at the receiving side: 

25 

receiving the voice, silence and control pack- 
ets, 

reading the white noise level in the control 
packet to generate the white noise packets 30 
which are interleaved between the voice pack- 
ets received from the transmitting side. 

In a second embodiment it comprises: 

35 

at the transmitting side: 

detecting the end of speech periods, 

transmitting the silence packets during the first 40 
period of time after tiie end of speech periods. 

at receiving side: 

detecting the end of speech periods in the 45 
received packets. 

calculating a white ndse level from the back- 
ground noise of at least the last one received 
packets, and generating the white noise pack- so 
ets to be interleaved between the voice packets 
which are received from the transmitting skie. 

The silence removal function of the invention is 
independent of the voice coding for ATM networks arxj ss 
with a minimal deperxJance for non-ATM (Asynchro- 
nous Transfer Mode) networks which is limited to the 
characterization of the control packet versus the ordi- 
nary voice samples packets. As required, there Is no 
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overhead on each packet transported in tiie network as 
with the prior art: for non ATM cells, only one control 
packet is sent by the transmitting side; for ATM net- 
works, there is no overhead on each packet transported 
in the network and AAL1 can be used. A packet loss 
control mechanism is also innplemented. Moreover, tiie 
processing resources used for the silence removal func- 
tion of the present invention is very low compared to tiie 
those necessary for the conpression algorithm. 

In tiie preferred embodiment of the invention the 
silence removal method Is implemented by a Digital Sig- 
nal Processor. A Voice Activity Detector (VAD) function 
is used to detect the silence packets of the input voice 
packets stream. In the telecommunications network 
access nodes, both the voice conpression and tiie 
silence removal functions may be implemented in voice 
dedicated adapters cards which include also the 
adapter usual routing functions. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Rgure 1 shows the voice signal and the corre- 
sponding samples voice stream transported by tiie 
packet or cell network. 

Rgure 2 describes the comfort noise generated 
during silence between two speeches. 

Figure 3 shows the white noise generation at a level 
periodically refreshed during silence. 

Figure 4 represents the process implemented when 
one packet is lost 

Figure 5 describes the process implemented when 
more tiian one packets are lost 

Rgure 6 shows the flow chart of the silence removal 
function wherein control packets are used at the packet 
transmitting side of tiie packet network. 

Rgure 7 shows the f tow chart of the silence removal 
function wherein control packets are used at the packet 
receiving side of the network. 

Rgure 8 shows the flow chart of the silence removal 
function wherein no comrol packets are used at tiie 
packet transmitting skJe of tiie packet network. 

Rgure 9 shows the flow chart of the silence renroval 
function wherein no control packets are used at tiie 
packet rec^ng skle of tiie packet network 

Rgure 10 illustrates tiie implementation of tiie 
silence removal function as a program running in an 
adapter card of a telecommunications access node 
dedicated to voice processing. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENT 

In Figure 1 are represented both the voice signal 
(10) and the con-espondtng stream of voice packets (20. 
40) entering a packet or cell communication network 
(70) via an input telecommunications access node (50) 
so called the packet transmitting side and generated by 
tiie output telecommunications access node (60) so 
called the packet receiving side and the network pack- 
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ets (30) transported through the network. The voice 
samples packets (20) enter the network at a constant bit 
rate of 64 Kbps in the PCM mode. Optionally, in the 
input node 50. the voice samples are conpressed and 
decompressed at the output node (60). As the voice sig- 5 
nal is a succession of speeches (15. 35) and silences 
(25. 45), the voice saitples packets convey correspond- 
ing high level sanrples so called 'speech samples' and 
lower level, background noise samples so called 
'silence samples'. It is the general object of the silence w 
removal function in the access nodes (50 and 60) not to 
send network packets for silence samples at the input of 
the network (50) as well as regenerating those silence 
sanrples at tiie output of the network (60). 

Rgure 2 describes the solution of the present is 
invention for silence renwal in a normal processing 
mode. The voice signal 10 comprising speech and 
silence signals (15. 25) is coded into voice samples 
packets (20) when entering the network. The state of 
the silence removal operation is a 'speech state on' so 
(150) until a silence is detected by a Voice Activity 
Detection function. At the packet transmitting side of the 
network (iiput node 50), when the silence is detected 
by tiie Voice Activity Detection function (VAD). the trans- 
mission goes on with the first silence samples (120) of 25 
the network packets (30). during a defined short first 
period of time (1 1 0) set at 80ms (for GSM packets it cor- 
responds to 4 packets as one packet takes 20ms) . This 
period of time is chosen so as to t>e sufficient to analyze 
the level of the background roise of the silence sam- 30 
pies. At the end of the period of 80 ms, the state of 
silence removal operation is 'silence state on' (1 60). The 
voice samples packets (40) being generated out of the 
network by node (60) during speech only connprtses the 
voice packets as received in input. After tiie period of 80 as 
ms, and during the period of silence, white noise sam- 
ples (130) are generated at a level corresponding to the 
background noise cateulated during the defined period 
of time mentioned above. When speech restarts (35) 
new network voice packets (1 40) are sent over the net- 40 
work and voice packets (1 45) are played at the output of 
the network. The figure 2 presents the two possible 
ways of determining the level of energy E for white noise 
to be gen^ated: 

either, the calculation is performed by the input node 45 
(transmitting packets side): in this case the input node 
sends a control packet (100) containing the level of 
white noise generated: at reception of the control 
packet, tile output node is generating the con^esponding 
silence sanples (1 30) with white noise WE correspond- so 
ing to the level E sent in the control packet; 
either no control packet (100) is sent and the network 
packets stream sent over tiie networks stops at the last 
silence packets (120) of the period of time 1 1 0; the out- 
put node receiving the network packets at the other skie ss 
of the network has the capacity (symmetrical to the 
input noise capacity) to detect end of speech and to cal- 
culate the white noise level to be generated. This kind of 
silence removal embodiment Is implemented in ATM 



networks access nodes.The characterization of the con- 
trol packet depends on the voice codirtg: for instance, 
witti the 32 bytes long GSM packets, the first byte being 
never set to FFFE. this value can be used to character- 
ize a control packet for a GSM coded voice sample 
stream. The energy level conveyed in tiiis control packet 
can take the two following bytes. This information is pro- 
tected in the control packet by redundancy in order to 
avoid bad adjustment of generated white noise, which 
means it is repeated on the bytes 3 and 4 and 5 and 6 
of the 32 bytes of ttie GSM packet 

Figure 3 illustrates the continuous monitoring of tiie 
white noise level. A second period of time (200) is 
defined of 10 to 20s in the preferred embodiment of ttie 
invention, attheefSof this period, the white noise level 
of tiie generated samples packets (40) at tiie output 
access node is recalculated from tiie current voice sam- 
ples arriving at the input of the network. At tiie output 
node, the level of the white noise samples generated is 
progressively readjusted to the new level value during a 
tiiird defined period of time (210). In a prefen-ed en^)od- 
iment of the invention, the third period is set to 80 ms. 

The network packets sent over the network repre- 
sented by 30 in figure 3 illustrates tiie case where tiie 
calculation of refreshed white noise level values from 
tiie current background noise is performed at the input 
node and a control packet containing the noise level of 
energy is sent from the input node; a first control packet 
is sent wh«i entering a silence (El), then a second con- 
trol packet (E2) witii tiie refreshed level is sent after tiie 
second d^ined period of time. As mentioned sooner, 
tiie white noise samples generated have a progressive 
level adjustment from El to E2 during the defined third 
period of time (80 ms and 4 packets); tiiis progressive 
level adjustinent being calculated by the output node. 

Optionally if the input node detects that the signal 
received is still silence arxl the energy variation of the 
silence is greater tiian a tiiereshoki value, it sends a 
control packet (E2) before the end of the second period. 

In the case where the input node does not send a 
control packet, it lets a burst of silence packets crossing 
tiie network during already known defined short first 
period of time (110) set at 80ms. In figure 3, each E 
packets should be replaced by 4 sOence packets. It Is up 
to the receiving node to detect these packets and calcu- 
late tiie new level of noise to be reached by tiie white 
noise packets generated at the output of the network. 
The output node, as witti the first case calculates also 
the linear interpolation to progressively reach tiie 
refreshed level of white noise to be generated. 

Rgure 4 and Figure 5 represent how the packet 
loss is controlled. This control added to the just seen 
white noise level monitoring, participates in tiie quality 
of the silence removal solution of the present invention. 
In figures 4 and 5, tiie three packets streams (250, 260, 
270) respectively correspond to the network packets 
sent over the network by the network input node, tiie 
one received by the network output node and the one 
reconstructed and sent by tiie network output node. Fig- 
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ure 4 shows the case where one network packet i^^as 
been lost when crossing the network: 270 indicates that 
the output node receiving packets generates the packet 
4 in the final output packet stream: this hides the error 
and optimizes the quality of the restored voice. Figure 5 5 
illustrates the case where consecutive network packets 
(more than one) are lost: the preceding network packet 
(4) will be generated once. After that until the next real 
packet is received, white noise packets (w) will be gen- 
erated at a determined level (around -45db is conven- yo 
lent). Technic of interpolation can be used to smooth the 
restored voice samples and avoid signal discontinuities. 

The preferred embodiment of the present inverrtion 
is a software implementation of the silence removal 
function. The detection of speech consisting in reading is 
the irput packets and identifying the 'silence packets' or 
^speech packets' is performed by a Voice Activity Detec- 
tor' (VAD). Any of the existing VAD algorithms can be 
used for the silence removal function. 

The silence removal function implemented In the 20 
input node is described in tiie case there the method 
uses control packets, in tfie flow chart of figure 6. The 
solution of the silence removal function program of tiie 
present invention indudes also, in the input node, 
instructions related to white noise level refresh as 25 
described in figure 3. In tiie output node, still when the 
method uses control packet, the silence removal pro- 
gram is run as described in tfie flow chart of figure 7. 
The white noise refresh level generation Instructions of 
the silence removal function is described in figure 3. 30 

The silence removal programs operating in tiie 
input node in the case where tiie method does not uses 
control packets, is described in figure 8. The silence 
renxjval function programs of the input node include 
also instructions related to white noise level refresh as 35 
desCTibed sooner in tiie description in the comments of 
figure 3. The principle, in the case where the program 
does not use control packets, is to repeat in the output 
node the end of speech detection and tiie white noise 
level cateutation which is implemented in the input node. 40 
The silence removal program fbw chart of tiie output 
node is described in figure 9. The program includes also 
the white noise refresh level generation instructions as 
desatoed sooner in the comments of figure 3. 

Rgure 6 is tiie fbw chart of the part of the silence 45 
removal function performed in tiie input network node 
(the transmitting side) when the control packets are 
used. This flow chart corresponds to the principle 
already discussed in tiie comments of figures 2 and 3. It 
is noted tiiat there are two possible states: silence state so 
where the silence removal program is run or 'speech 
state' where the 'normal' voice transport processing 
applies: tiiis voice processing can, for instance, include 
a voice conrpression processing. In 300 the input voice 
packet arriving on the input line is read and tiie test 310 ss 
is for silence or speech voice sample detection. This 
test is performed by the VAD function. If tiie packet 
received is a silence packet (branch YES), the following 
test 315 is run for defining if it is a silence packet corre- 
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sponding to a first timer period (1 1 0) of figure 2 or if said 
first period timer has expired (branch YES). On branch 
YES, 330 is for testing if tiie packet read is a first silence 
packet or if the state was already a 'silence state'. If the 
slate was already a 'silence state' (YES branch), tiie 
end of defined second period of time timer (200) is 
checked (340). As explained in tiie comments of figure 
3 the second period of time corresponds to the white 
noise level refresh principle. If tiie second period timer 
is expired, (YES branch), this means tfiat tiie level of 
white noise which is generated at the output of the net- 
work during silence state ne eds tQ-b.e- ref f eshed; thus, a 
control packet is prepared and sent: the n eww hite noise_ 
*evd^calculla^^ 

^[^tvoteeMd^^gfifi^; the white noise level calcu- 
lateaT^Hiclu3iSin a control packet (350). The control 
packet format depends on the voice coding. The second 
period of time timer is started (360); a control packet 
including the new level of white noise is sent over the 
network by tiie input node (370) and the next sample is 
read (300). In test 340. if secorKi period of time is not 
expired, (NO branch), ttie next packet is read (300). If 
the answer to test 330 is NO. this means that tiie silence 
packet received is a first silence packet received after a 
speech state and the silence state is entered (380); the 
new white noise level is calculated (385) from the back- 
ground noise of the last voice packet received; a second 
period of time is started (390); a control packet including 
the new level of white noise is sent over the network by 
the input node (395) and the next packet is read (300). 
If. after test 310 we know tiiat the packet received is a 
speech packet, the first period timer is started (317) to 
prepare tiie case where tiiis speech packet is the last 
one before end of speech (step unused in the other 
cases). Then, the packet read is sent over the network 
(320). At tiiis step, voice processing can optionally 
include a step of voice cornpression; the voice state is 
entered (or confirmed) (325) and tiie next packet is read 
(300). 

Figure 7 is the flow chart of the part of the silence 
removal program located in the output network node 
(the receiving side) when control packets are used. This 
flow chart corresponds to the principle explained sooner 
in figures 2. 3. 4 and 5. It is noted that in tiie output 
node, three states are possible: speech, silence or 
'packet lost' states. In the output node when a padcet is 
received (test 500 YES branch), it is tested if it is or not 
a control packet (test 510). If it is not a control packet, 
tills means tiiat it is a speech voice packet; tiie speech 
state in entered (or confirmed) (515) and tfie voice 
packet is played on tiie output line (520) by the output 
program. Optionally, voice processing system can 
indude at this step a step off voice decompression. The 
next packet is read (500). If the answer to test 510 is 
yes. tiiis means tiiat a conf ol packet has been received. 
The silence state is then tested (525) and if the answer 
is no. this means that this control packet is to advise of 
a silence in tiie voice signal: tiie silence state is entered " 
(530) and the noise level read in the packet is stored 
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(535) and played in the output line in a white noise sam- 
ple (540). the next packet is read (500). If the answer to 
test 525 is YES, this means that this control packet is to 
refresh white noise level samples played by the output 
node; the new noise level is then stored (545) and the 5 
another noise level is calculated (550) : it is calculated 
with a linear interpolation algorithm to have a progres- 
sive upgrade of the white noise level when it has been 
refreshed; with this algorithm, the progressive levels are 
calculated to reach the refreshed value on a given w 
period of time (said ttiird defined period of time); the 
process goes on with step 535. If the answer of test 500 
is NO. this means that no packet has been received: the 
state is tested (555) if the state is silence state, this 
means that a white noise packet has to be generated is 
ar%j played at a level coming either form interpolation or 
just the value read in the last control packet; the packet 
is played on tiie output line (560) before reading the next 
packet (500). If the state is not silence state (branch NO 
of test 555), and if the state is not packet lost (branch 20 
NO of test 565), this means that a speech packet shodd 
have been received and tiiat a first packet is lost: the 
state 'packet lost' is set and the previous packet is dupli- 
cated and played on the output line (575) before reading 
the next packet (500). If the answer of test 565 is YES. 25 
this means that it is not the first packet lost and a white 
noise packet is generated arKi played at a given level 
(45 db is convenient) (580) before reading tiie next 
packet (500). 

Figure 8 is the flow chart of the part of the silence 30 
rennoval program located in tiie input network node (the 
transmitting side) when the control packets are not 
used: this is the case of ATM networks when using 
AAL1. This flow chart corresponds to the principle 
already discussed in tiie comments of figures 2 and 3. 35 
In 600 the input packet is read and the test 610 is for 
silence or voice detection, this Is performed by the VAD. 
If tiie packet received is a silence packet (branch YES), 
the following test 615 is run for defining if it Is a silence 
packet conresponding to the first timer period (110) of 40 
figure 2 or if said first period timer has expired (branch 
YES). If YES. (630) is for testing if the packet read is a 
first silence packet (branch NO), or if the state was 
already a silence state (branch YES): 
if no, the silence state is entered (680) and a second 4S 
timer is started (690) . If yes, it is tested at 640 if the sec- 
ond timer has expired. If it has expired, first and second 
timers are started 660, and tiie process is resumed at 
600, so tiiat a burst of silence packets will be send at 
step 620. If tiie timer has not expired, the process is so 
resumed at 600. 

If test 610 answer is NO. we know that the packet 
received is a voice packet, the first period timer is 
started (617) to prepare the case where this voice 
packet is the last one before end of speech (step ss 
unused in the other cases). Then, tiie packet read is 
packetized and sent over tiie network (620); at this step, 
voice processing can optionally include a step of voice 
conrpression; the next packet is then read (600). 



Rgure 9 is the flow chart of the part of tiie silence 
removal program located in tiie output network node 
(the receiving side) when control packets are not used: 
this is the case of ATM networks, when AAL1 protocol is 
used. This flow chart corresponds to the principle 
explained sooner in figures 2 and 3. It is noted tiiat in 
the output node, four states are possible: speech, 
silence, 'refresh white noise' or 'packet lost' states. In 
tiie output node if no packet is received (test 700 NO 
branch), the process goes on with the step 555 in flow 
chart of figure 7. The only difference is that tiie white 
noise level of the white noise packet played on the out- 
put line (560) has not been read in a control packet sent 
by tiie input node but has been calculated in tiie output 
node. If the answer to test 700 is YES, tiiis means one 
packet has been received , the speech state is then set 
(710). At this step the voice processing can optionally 
include a step of voice decompression. It is then tested 
if the packet is a silence packet or not (720). If NO. the 
•white noise refresh' state is set (725) and a first period 
timer is set (730) to prepare tiie burst of packets which 
wilt be used to calculate the white noise level. Then tiie 
packet is played by the output node (735) and the next 
packet is read (700). If the answer to test 720 is YES. 
tiiis means that a silence packet has t>een read and the 
white noise level is calculated (740); if tiie first period 
timer has not yet expired (branch NO of test 745), tiie 
silence stale is set (750) and the refresh white noise 
level state is simultaneously set (755); then a white 
noise packet with the white noise level calculated is sent 
by the output node (760); ttien, next packet is waited for 
in 700. If tiie answer to the test 745 is YES, this means 
tiiat tiie first period timer has expired and if the 'refresh* 
state is set (branch YES of test (765). a new white noise 
level is calculated (770) with a chosen linear interpola- 
tion algoritiim (many are possible); ttien. ttie silence 
state is set (775) and the white noise sample is played 
(760). If the answer to test 755 is NO, this means that 
tiie refresh state Is not set and tiius. tiie white noise 
level has not to be interpolated before generating and 
playing the white noise packet (760). In the output node 
if no packet is received (test 700 NO branch), the proc- 
ess goes on with the step 555 in flow chart of figure 7. 
The only difference Is tiiat tiie white noise level of the 
white noise packet played by the output node (560) has 
not been read in a corrtrol packet sent by tiie input node 
but has been calculated (step 740) in the output node. 

The implementation' in tiie telecommunications 
access nodes of the processing functions such as voice 
compression, is done on a dedicated voice traffic 
adapter card. Figure 10 illusti'ates the components 
related to silence removal in the dedicated voice 
processing adapter card (400): the program insti^uctions 
are stored in a first RAM storage unit (410) and are read 
via tiie bus 440, by a Digital Signal Processor (450) or a 
pool of DSPs, which stores for instruction execution the 
data in a second RAM storage unit 420 which it 
accesses through the bus 430. Input voice packets are 
arriving in ttie DSP through the input serial link 460 and 
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are sent out through the output serial link 470. The DSP 
450 can be used also on the voice dedicated adapter 
card to run the voice compression algorithms. The 
adapter card contains also the components for the 
usual routing functions of an adapter card. The input 5 
and output serial links (460.470) being the means to 
conrununicate with these components. 

Claims 

10 

1. A method for transporting a stream of packets 
between a tranmitting side and a receiving side 
through a comnuinication network, said stream 
including voice packets corresponding to speech 
periods and silence packets corresporxling to is 
silence periods, characterized in that it comprises: 

at the transmitting side, detecting the ends of 
speech periods and transmitting only the voice 
packets corresponding to the speech periods 20 
followed by the silence packets following the 
speech periods for a first period of time. 

at the receiving side, reconstituting said stream 
by interleaving between the voice packets 2s 
received from the transmitting side, white noise 
packets corresponding to background noise of 
the preceding voice packets. 

2. The method according to claim 1 , characterized in 30 
that it corrprises the steps of: 

at the transmitting side: 

detecting the end of speech periods. 3s 

transmitting the silence packets during the 
first period of time after the end of speech 
periods, 

40 

caluculating at the end of speech periods, 
a white noise level con-esponding to the 
background noise of at least one preced- 
ing packet. 

45 

transmitting to the receiving side, a corttrol 
packet irK^luding said white noise level. 

at the receiving side: 

so 

receiving the voice, silence and control 
packets, 

reading the white noise level in the control 
packet to generate the white noise packets ss 
which are interleaved between the voice 
packets received from the transmitting 
side. 



3. The method according to daim 1. characterized in 
that it comprises: 

at the transmitting side: 

detecting the end of speech periods, 

transmitting the silence packets during the 
first period of time after the end of speech 
periods. 

at the receiving side: 

detecting the end of speech periods In the 
received packets. 

calculating a white noise level from the 
background noise of at least the last one 
received packets, and genetating the white 
noise packets to be interleaved between 
the voice packets which are received from 
the transmitting side. 

4. The method according to claim 2 characterized in 
that it comprises at the receiving side: 

if no packet Is received and if the last received 
packet is not a control packet, repeating in the 
packet stream reconstituted by the receiving 
side, the last received packet at least once. 

5. The method according to claim 3 characterized in 
that it comprises at the receiving side: 

if no packet Is received and no end of speech is 
detected, repeating in the packet stream recon- 
stituted by the receiving side, the last received 
packet at least once. 

6. The method according to claim 2 or 4. character- 
ized in that it comprises at the transmitting side, 

starting a timer having a second period of time 
at the end of the first perkxj. 

when the timer Is expired and if no voice pack- 
ets are to be transmitted, calculating a new 
white noise level corresponding to the back- 
ground noise of silence period and sending a 
new corresponding control packet which Is 
used by the receiving side for generating 
refreshed white noise packets. 

7. The method according to any one of dalms 3 and 5 
characterized in that it comprises: 

at the transnnitting side: 

starting a timer having a second period of 
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time at the end of the first period, 

when the timer is expired and if no voice 
packets are to be transmitted, sending a 
burst of silence packets and at the receiv- s 
ing side. 

receiving the burst of silence packets and 
calculating a new white noise level which is 
used for generating refreshed white noise io 
packets. 

8. The method according to any one of claim 6 or 7, 
characterized in that it comprises: 

15 

calculating with a linear interpolation algorithm 
and for a third period of time, the progressive 
values of the next white noise packets to be 
generated. 

20 

9. A system for transporting a stream of packets 
between a transmitting side and a receiving side 
through a communication network, said stream 
including voice packets corresponding to speech 
periods and silence packets correspcrxiing to 25 
silence perkxis, sakJ system comprising means for 
carrying out the methods according to any one of 
claims 1 to 7. 

30 
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